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CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is related to a U.S. Provisional Patent Application entitled 
"HIGH AVAILABILITY COMPUTING SYSTEM AND METHOD" attorney 
10 docket reference 499.05 7PRV, filed on even date herewith. 

FIELD OF THE INVENTION 

This invention relates generally to computer communications, and more 
particularly to control and communication of highly available computing systems. 

15 

COPYRIGHT NOTICE/PERMISSION 

A portion of the disclosure of this patent document contains material which is 
subject to copyright protection. The copyright owner has no objection to the 
facsimile reproduction by anyone of the patent document or the patent disclosure as 
20 it appears in the Patent and Trademark Office patent file or records, but otherwise 

reserves all copyright rights whatsoever. The following notice applies to the 
software and data as described below and in the drawings hereto: Copyright © 2000, 
Silicon Graphics Incorporated, All Rights Reserved. 

25 BACKGROUND OF THE INVENTION 

Parallel computer systems provide economic, scalable, and high-availability 
approaches to computing solutions. From the point of view of managing computer 
systems including parallel-processor systems, there is a need for system 
administration and control in order to obtain desired system operation. 
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Conventional interfaces for various system-administration applications are 
unsatisfactory because they can be inconsistent, requiring the user to learn each one 
of a variety of different system-administration applications. Such system 
administration applications sometimes are not reliable or secure. Customers want to 
5 administer their system from a console of their choice, whether it's running Linux, 

IRIX, MacOS, Solaris, IRIX, Windows NT, or other OS. Typical system 
administration applications are hard to use and confusing for the non-technical user. 
Prerequisites aren't given up front or explained clearly, feedback is not given after 
each operation, and so forth. Applications that are not localized are difficult for non- 

10 English-speaking users to use. Applications that are not localized at the outset can be 

difficult for developers to retrofit with localization function calls. 

Conventionally, scalable parallel computing systems have management 
control software that is not scalable, and in which there are decreasing economies of 
scale as the number of computing units in the system increases. As the number of 

15 computing units in the parallel computing system increases, the overhead 

communications traffic increases not just geometrically, but in some cases, 
exponentially, creating increasing overhead burden in which an increasing amount of 
processing power is used merely by communicating between the computing units, 
and a proportionately lesser amount of processing power is available to support the 

20 application. Thus, the system management for scalable parallel processing systems 

have diminishing returns in terms of performance and cost. As a result, there is a 
need for system management software for scalable parallel processing systems that 
yields increasing scales of economies as the number of computing units is the system 
increases. 

25 For further background, the following pointers to on-line sources may be 

used: 

Linuxconf, a system administration API and application: 
litt p://www«solucorp . qc.ca/linuxconlY 
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Veritas Volume Manager Storage Administrator, a Java-based GUI: 

5 which are reproduced as attached Appendices L, M, and N. 

SGI sysadmdesktop, a Motif-based GUI for personal system administration: 

Sudo, a program that allows system administrators to give certain users the 
ability to run some commands as root or as another user while logging the commands 
10 and arguments: 

Microsoft wizards, step-by-step GUI guides, reminiscent of Rhino tasksets: 

1 5 SUMMARY OF THE INVENTION 

The present invention provides solutions to the above-described shortcomings 
in conventional approaches, as well as other advantages apparent from the 
description and appendices below. 

The present invention provides an API having a client-side API that is 

20 consistant across many computing platforms, and a server-side API that can be 

customized for a particular hardware platform. In some embodiments, the client-side 
API and code is in Java to provide portability, and the server-side API and code is in 
C++ customized to a particular platform, such as an SGI high-performance system, 
in order to provide speed and security. 

25 One aspect of the present invention provides an applications-programming 

interface (API) for high-availability computing in a computer system having at least 
one server and at least one client, the API including: a server-side API; a client-side 
API; a model having server-side priveleged-run (runpriv) and privileged-commands 
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(privemd) function such that only one program need be setuid-root on the server; an 
association class and three derived subclasses of ComputedAssoc, ChildAttrAssoc, 
and ParentAttrAssoc for use in server-side code and client-side code; and a graphical 
user interface client-side model, wherein the association class monitors a parent item, 
5 and the three subclasses are responsible for determining child items to be added, 

changed and deleted. 

Another aspect of the present invention provides an information handling 
system having stored thereon one of the APIs described above. 

Another aspect of the present invention provides a computer-readable storage 
10 medium having stored thereon one of the APIs described above. 

Another aspect of the present invention provides a single-image method for 
high-availability computing in a computer system having at least one server and at 
least one client, the method comprising: providing a server-side API; providing a 
client-side API; using a model having server-side priveleged-run (runpriv) and 
15 privileged-commands (privemd) function such that only one program need be setuid- 

root on the server; providing an association class and three derived subclasses of 
ComputedAssoc, ChildAttrAssoc, and ParentAttrAssoc for use in server-side code 
and client-side code; and providing a graphical user interface client-side model, 
wherein the association class monitors a parent item, and the three subclasses are 
20 responsible for determining child items to be added, changed and deleted. 

Another aspect of the present invention provides an article comprising a 
computer readable medium having instructions thereon, wherein the instructions, 
when executed in a computer, create a system for executing the method described 
above. 

25 The present invention provides the following advantages: 

Applications based on the present invention look and behave the same, 
because they use the same components. Thus users of one application based on the 
present invention can easily transfer to using another. The present invention is built 
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to be reliable and secure; its runpriv and privileged-commands (privemd) model 
means that only one program need be setuid-root on the server. 

1 . By using Java on the client side, Rhino-based applications can be run on any 
system having a Java virtual machine (Linux, IRIX, Windows NT, MacOS, 

5 and so on). Rhino-based applications can thus run as an applet in a web 

browser, or as a standalone application. 

2. Rhino embodies several principles of interaction that have been identified to 
confuse users. For details, please see hard-copy document titled, "Rhino 
Users Model," particularly "The Rhino users model in action" section. 

10 3. Rhino is localized, with all the user- visible strings kept separate from the 

GUI components, making it easy for Rhino-based applications to be 
localized. 

DETAILED DESCRIPTION OF THE INVENTION 

1 5 In the following detailed description of exemplary embodiments of the 

invention, reference is made to the accompanying drawings which form a part hereof, 
and in which is shown by way of illustration specific exemplary embodiments in 
which the invention may be practiced. These embodiments are described in 
sufficient detail to enable those skilled in the art to practice the invention, and it is to 

20 be understood that other embodiments may be utilized and that logical, mechanical, 
electrical and other changes may be made without departing from the scope of the 
present invention. The following detailed description is, therefore, not to be taken in 
a limiting sense, and the scope of the present invention is defined only by the 
appended claims. 

25 The present invention provides an infrastructure for system administration 

applications. One embodiment, called "Rhino" and available from Silicon Graphics, 



Inc. (SGI), the assignee of the present invention, is an infrastructure for building 
applications that configure, manage, and monitor hardware and software. Rhino 
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provides a common, consistent, task-based, internationalized, secure graphical user 
interface (GUI). Rhino-based applications have a command-line interface (CLI) that 
system administrators can use to write scripts. Rhino applications consist of a client- 
side GUI written in Java, a server-side daemon, server-side command-line interfaces 
5 written in C++ or C, and server-side libraries for monitoring the state of the system. 

Communications between the client and server are non-blocking and 
transparent to the application. Encryption is supported (pending export compliance 
approval) in some embodiments, as are security plugins. 

The following SGI products use Rhino: 
10 • FailSafe 2.0 GUI. Subsequent overlays have been shipped as patches, on a 

roughly quarterly basis since original shipment. 

CXFS GUI. First shipped with IRIX 6.5.6F. Bug fixes were made in IRIX 
6.5.7F. Subsequent releases are planned with IRIX 6.5. 8F and later releases. 
XVM GUI. In development, with planned initial release with IRIX 6.5. 10F. 
1 5 . Installation replicator for Linux. In development, with planned initial release 

with SGI Linux 1.3 (SGI ProPack 1.3 for Linux). 

Rhino was created to solve the following problems: 

Interfaces for various system administration applications can be inconsistent, 
20 requiring the user to learn each. 

System administration applications sometimes are not reliable or secure. 
Customers want to administer their system from a console of their choice, 
whether it's running Linux, IRIX, MacOS, Solaris, IRIX, Windows NT, or 
other OS. 

25 • Typical system administration applications are hard to use and confusing for 

the non-technical user. Prerequisites aren't given up front or explained 
clearly, feedback is not given after each operation, and so forth. 
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Applications that are not localized are difficult for non-English-speaking 
users to use. Applications that are not localized at the outset can be difficult 
for developers to retrofit with localization function calls. 

5 The present invention solves the problems in the following ways: 

Rhino-based applications look and behave the same, because they use the 
same components. Thus users of one Rhino-based application can easily 
transfer to using another. 

Rhino was built to be reliable and secure; its runpriv and privileged- 
10 commands (privemd) model means that only one program need be setuid-root 

on the server. 

By using Java on the client side, Rhino-based applications can be run on any 
system having a Java virtual machine (Linux, IRIX, Windows NT, MacOS, 
and so on). Rhino-based applications can thus run as an applet in a web 

1 5 browser, or as a standalone application. 

Rhino embodies several principles of interaction that have been identified to 
confuse users. For details, please see hard-copy document titled, "Rhino 
Users Model," particularly "The Rhino users model in action" section. 
Rhino is localized, with all the user- visible strings kept separate from the 

20 GUI components, making it easy for Rhino-based applications to be 

localized. 

There are other Rhino features that address the problems mentioned in the 
previous section. 

Rhino-based graphical user interfaces help to make system administration 
25 applications complete and truly usable by customers (ex., FailSafe, CXFS). Today, 

system administration applications are not considered professional quality unless 
they have a well-designed GUI, and Rhino makes it easy for developers to create 
well-designed GUIs for their system administration applications. 
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Also, having a common infrastructure makes it less time-consuming and 
expensive for developers to create additional system administration GUIs. This 
technology has broad applicability. 

Attached Appendix J provides a description of how to use one embodiment of 
5 the present invention. Attached Appendix K provides code used to implement one 

embodiment of the present invention. 

Conclusion 

The present invention provides an API having a client-side API that is 
10 consistant across many computing platforms, and a server-side API that can be 

customized for a particular hardware platform. In some embodiments, the client-side 
API and code is in Java to provide portability, and the server-side API and code is in 
C++ customized to a particular platform, such as an SGI high-performance system, 
in order to provide speed and security. 
1 5 One aspect of the present invention provides an applications-programming 

interface (API) for high-availability computing in a computer system having at least 
one server and at least one client, the API including: a server-side API; a client-side 
API; a model having server-side priveleged-run (runpriv) and privileged-commands 
(privemd) function such that only one program need be setuid-root on the server; an 
20 association class and three derived subclasses of ComputedAssoc, ChiidAttrAssoc, 

and ParentAttrAssoc for use in server-side code and client-side code; and a graphical 
user interface client-side model, wherein the association class monitors a parent item, 
and the three subclasses are responsible for determining child items to be added, 
changed and deleted. 

25 In some embodiments of the API, the ComputedAssoc subclass is for 

deriving classes that can be computed from values of one or more attributes of 
monitored child and parent items, and that monitors a parent item from a parent 
catagory and one or more items in a child catagory that are potential children. 
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In some embodiments of the API, the Chi Id Attr Assoc subclass is for deriving 
classes to represent relationships in which a child item stores selectors of one or 
more parent items as part of attributes of the ChildAttrAssoc subclass. 

In some embodiments of the API, the ParentAttrAssoc subclass is for 
5 deriving classes to represent relationships in which a parent item stores selectors of 

one or more child items as part of attributes of the ParentAttrAssoc subclass. 

In some embodiments of the API, the association is implemented in JAVA on 
the client side, enabling applications to be run on any system having a Java virtual 
machine. 

10 In some embodiments of the API, the API is localized, with all the user- 

visible strings kept separate from the GUI components, making it easy for Rhino- 
based applications to be localized. 

Some embodiments of the API further include an AssocFactory class that is a 
factory class for association objects, wherein AssocFactory methods are used to 
1 5 fulfill requests from remote clients. 

In some embodiments of the API, information is obtained about one or more 
associations and items by: obtaining a handle to an association instance; and 
obtaining information about item instances. 

Another aspect of the present invention provides an information handling 
20 sy stem having stored thereon one of the APIs described above. 

Another aspect of the present invention provides a computer-readable storage 
medium having stored thereon one of the APIs described above. 

Another aspect of the present invention provides a single-image method for 
high-availability computing in a computer system having at least one server and at 
25 least one client, the method comprising: providing a server-side API; providing a 

client-side API; using a model having server-side priveleged-run (runpriv) and 
privileged-commands (privemd) function such that only one program need be setuid- 
root on the server; providing an association class and three derived subclasses of 

9 
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ComputedAssoc, ChildAttr Assoc, and ParentAttrAssoc for use in server-side code 
and client-side code; and providing a graphical user interface client-side model, 
wherein the association class monitors a parent item, and the three subclasses are 
responsible for determining child items to be added, changed and deleted. 
5 Another aspect of the present invention provides an article comprising a 

computer readable medium having instructions thereon, wherein the instructions, 
when executed in a computer, create a system for executing the method described 
above. 

Another aspect of the present invention provides an information handling 
10 system having stored thereon instructions for executing the method. 

Another aspect of the present invention provides a computer-readable storage 
medium having stored thereon instructions for executing the method. 

It is to be understood that the above description is intended to be illustrative, 
and not restrictive. Although numerous characteristics and advantages of various 
1 5 embodiments of the present invention have been set forth in the foregoing 

description, together with details of the structure and function of various 
embodiments, many other embodiments and changes to details will be apparent to 
those of skill in the art upon reviewing the above description. The scope of the 
invention should, therefore, be determined with reference to the appended claims, 
20 along with the fuii scope of equivalents to which such claims are entitled. 



10 
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WHAT IS CLAIMED IS: 

1 . An applications-programming interface (API) for high-availability computing 
in a computer system having at least one server and at least one client, the API 

5 comprising: 

a server-side API; 
a client-side API; 

a model having server-side priveleged-run (runpriv) and privileged- 
commands (privemd) function such that only one program need be setuid-root on the 
10 server; 

an association class and three derived subclasses of ComputedAssoc, 
Chi Id Attr Assoc, and Parent Attr Assoc for use in server-side code and client-side 
code; and 

a graphical user interface client-side model, 
15 wherein the association class monitors a parent item, and the three subclasses are 

responsible for determining child items to be added, changed and deleted. 

2. The API of claim 1 , wherein the ComputedAssoc subclass is for deriving 
classes that can be computed from values of one or more attributes of monitored 

20 child and parent items, aiid dial monitors a parent item from a parent catagory and 

one or more items in a child catagory that are potential children. 

3. The API of claim 1, wherein the Chi Id Attr Assoc subclass is for deriving 
classes to represent relationships in which a child item stores selectors of one or 

25 more parent items as part of attributes of the ChildAttrAssoc subclass. 
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4. The API of claim 1 , wherein the Parent Attr Assoc subclass is for deriving 
classes to represent relationships in which a parent item stores selectors of one or 
more child items as part of attributes of the ParentAttrAssoc subclass. 

5 5. The API of claim 1, wherein the association is implemented in JAVA on the 

client side, enabling applications to be run on any system having a Java virtual 
machine. 

6. The API of claim 1 , wherein the API is localized, with all the user-visible 
10 strings kept separate from the GUI components, making it easy for Rhino-based 

applications to be localized. 

7. The API of claim 1, further comprising an AssocFactory class that is a 
factory class for association objects, wherein AssocFactory methods are used to 

1 5 fulfill requests from remote clients. 

8. The API of claim 1, wherein information is obtained about one or more 
associations and items by: 

obtaining a handle to an association instance; and 
20 obtaining information about item instances. 

9. An information handling system having stored thereon the API of claim 1. 

1 0. A computer-readable storage medium having stored thereon the API of claim 
25 1. 
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11. A method of providing a single-image model for high-availability computing 
in a computer system having at least one server and at least one client, the method 
comprising: 

providing a server-side API; 
5 providing a client-side API; 

using a model having server-side priveleged-run (runpriv) and privileged- 
commands (privemd) function such that only one program need be setuid-root on the 
server; 

providing an association class and three derived subclasses of 
10 ComputedAssoc, ChildAttrAssoc, and ParentAttrAssoc for use in server-side code 

and client-side code; and 

providing a graphical user interface client-side model, 
wherein the association class monitors a parent item, and the three subclasses are 
responsible for determining child items to be added, changed and deleted. 



15 



12. An article comprising a computer readable medium having instructions 
thereon, wherein the instructions, when executed in a computer, create a system for 
executing the method of claim 1 1 . 



20 13. An information handling system having stored thereon msti actions fur 

executing the method of claim 1 1 . 

14. A computer-readable storage medium having stored thereon instructions for 
executing the method of claim 1 1 . 

25 
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ABSTRACT OF THE DISCLOSURE 

An applications-programming interface (API) for high-availability computing 
5 in a computer system having at least one server and at least one client, the API 

including: a server-side API; a client-side API; a model having server-side 
priveleged-run (runpriv) and privileged-commands (privemd) function such that only 
one program need be setuid-root on the server; an association class and three derived 
subclasses of ComputedAssoc, ChildAttrAssoc, and ParentAttrAssoc for use in 
10 server-side code and client-side code; and a graphical user interface client-side 

model, wherein the association class monitors a parent item," and the three subclasses 
are responsible for determining child items to be added, changed and deleted. 
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Introduction 

The various pieces of cluster software each have a number of configurable parameters that must be 
set by the system administrator. In the past these parameters had been maintained in several ASCII 
text files. However, this is no longer adequate for several reasons: 

• A cluster may now consist of 8 or more nodes, which could make a text configuration file large 
and awkward to work with. 

• The configuration information must generally be available to all of the nodes in a cluster. 
Historically this has been dealt with by manually copying the configuration file from one node 
to the other, but this will no longer be practical if there are many nodes in a cluster. 

• It will soon be necessary to be able to update the configuration from other programs (e.g. 
Cluster Manager). It is difficult to maintain the validity of data handled in this way if it can 
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Introduction 

The configuration and administration of SGI cluster systems will be done in at least two different ways: 

1 . Using a graphical user interface, known tentatively by names such as ,f Chister Manager" or "FailSafe 
Manager". This will be referred to as "The GUI" and will be provided by the System Administration 



2. Using a command line interface, similar in style to tools such as xiv_mgr. This will be referred to as f The 
CLI" and will be provided by the HA Infrastructure group. 

To avoid duplicated effort, both of these tools will use the same mechanisms for effecting changes and 
monitoring status in the cluster. This document describes those common mechanisms at an architectural level 
Specific implementation details are described [elsewhere] . 

Commands 

Most cluster operations ultimately involve the relatively small, discrete tasks which are described in the Cluster 
Administration Tasks document. Each task will be performed by one of a number of specialized unix task 
commands (provided by the HA Infrastructure group) that will be executed in a separate process by the GUI 
or CLI. 
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Introduction 

Cluster Administration Services (CAS) provides a variety of different services, both natively (e.g. CAS 
messaging) and in the form of phxgin services (e.g. the Cluster Adminstration Monitor [CAM]). Underlying these 
services is a low- level infrastructure that provides such features as tagging and storage management These are 
referred to collectively as the "CAS Basic Services", 

This document describes these basic services provided by CAS and the API used to access them. For 
information on other aspects of CAS, see: 

• Cluster Administration Services - Messaging 

• Cluster Administration Services - Phigins 
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Introduction 

By itself Cluster Administration Services (CAS) is little more than a message bus, routing data between 
different pieces of cluster software. Rather than provide additional features on its own, CAS relies on plugin 
modules to extend its functionality. CAS plugin modules (also referred to simply as plugins) not only make new 
services available to client programs, but also provide new communication mechanisms and message types as 
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